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Abstract 

In this paper we consider the problem of wind energy production using a 
second order semi-Markov chain in state and duration as a model of wind 
speed. We present the mathematical model, we describe the data and tech- 
nical characteristics of a commercial wind turbine (Aircon HAWT-lOkW). 
Wc show how to compute some of the main dependability measures such as 
reliability, availability and maintainability functions. We compare the re- 
sults of the model with real energy production obtained from data available 
in the Lastem station (Italy) and sampled every 10 minutes. The compu- 
tation of the dependability measures is a crucial point in the planning and 
development of a wind farm. 

Keywords: semi-Markov chains, synthetic time series, dependability 

analysis 



1. Introduction 

Wind is one of the most important renewable energy sources. Wind en- 
ergy is produced by converting the kinetic energy of wind into electrical 
energy by means of a generator. For this reason it is important to dispose of 
an efficient stochastic model for wind speed changes. 
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First and second order Markov chain models have been extensively used in 
wind speed modelling and synthetic time series generation, see e.g. Shamshad 
et al. m, Nfaoui et al. [2], Youcef et al. [3J, Shain et al. ^ and Torre et al. 

The Markovian assumption has, especially in the modelling of wind speed, 
several flaws. In discrete time, waiting times in a state before making a tran- 
sition to another state are geometrically distributed and consequently the 
memoryless property applies. This leads to a great simplification of the 
model which is unable to reproduce correctly the statistical properties of the 
real wind speed process. 

Semi-Markov chains do not have this constraint, because the waiting 
time distribution function in the states can be of any type and this al- 
low the data to speak for themselves without any restriction. For this 
reason semi-Markov chains have been extensively applied to different fields 

D'Amico et al. p3] was the first paper where semi-Markov chains were 
applied in the modelling of wind speed. In that paper were proposed first 
and second order semi-Markov models with the aim of generate synthetic 
wind speed data. It was shown that all the semi-Markov models perform 
better than the Markov chain model in reproducing the statistical properties 
of wind speed data. In particular, the model recognized as being the more 
suitable is the second order semi-Markov model in state and duration. 

In this paper we show how to compute dependability measures as avail- 
ability, reliability and maintainability functions for the second order semi- 
Markov chain in state and duration. These indicators give important in- 
formation on the feasibility of the investment in a wind farm by giving the 
possibility to quantify the uncertainty in the wind energy production. 

Another important aspect is related to the location of the wind farm. In 
fact today much wind farm are built offshore for different reasons: the wind 
speed is more powerful and constant due to the absence of obstacles, the vi- 
sual, environmental and acoustic impact is cut down. The maintenance cost, 
instead, is higher than the onshore wind farm. A good stochastic model can 
help the planning of preventive maintenance suggesting when is suitable to 
do the maintenance operation. This is possible by analyzing what happens 
after a particular transition between two different states that is lasted for a 
certain time period. 

The results presented here are new and generalize some of the results 
obtained for semi-Markov chain of order one (see Barbu and Limnios [15] 
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and Blasi et al. [IS]). The model generalizes also Markov chains and re- 
newal models. We apply our model to a real case of energy production. For 
this reason we choose a commercial wind turbine, a 10 kW Aricon HAWT 
assumed to be installed at the station of L.S.I -Lastem which is situated in 
Italy. 

This paper is organized as follows. Section 2 presents some definitions 
and notation on the second order semi-Markov chain in state and duration. 
Section 3 describes the database and the technical characteristics of the com- 
mercial wind turbine. Section 4 shows the way in which it is possible to com- 
pute the dependability measures via kernel transformations and the value 
computed on the real data and on the synthetic data are compared. In the 
last section some concluding remarks and possible extensions are presented. 

2. The second order semi-Markov chains in state and duration 

In this section we describe briefly the second order semi-Markov chain in 
state and duration, see D'Amico et al. [H] for additional results. 

Let us consider a finite set of states £" = {1, 2, S'} in which the system 
can be into and a complete probability space {fl, F, P) on which we define 
the following random variables: 

Jn-.^^E, Tn-.n^^N. (1) 

They denote the state occupied at the n-th transition and the time of the 
n-th transition, respectively. To be more concrete, by J„ we denote the wind 
speed at the n-th transition and by T„ the time of the n-th transition of the 
wind speed. 

We assume that 

^Jn+i = j, Tn+i -Tn = t|cr( J5, 7^), J„ = k, J„_i = i, - T„_i = X, < s < n] 

= Pi^Ai+l = J; Tn = t\Jn = Jn-l — h = '.= xQi.kjit)- 

(2) 

Relation (|2| asserts that, the knowledge of the values J„, Jn-i,Tn — T„_i 
suffices to give the conditional distribution of the couple J„+i,T„+i — T„ 
whatever the values of the past variables might be. Therefore to make prob- 
abilistic forecasting we need the knowledge of the last two visited states and 
the duration time of the transition between them. For this reason we called 
this model a second order semi-Markov chains in state and duration. 

It should be remarked that in the paper by Limnios and Opri§an [17] 
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were defined n-th order semi-Markov chains in continuous time. Anyway the 
dependence was only on past states and not on durations. 
The conditional probabilities 

xQi.k,j{t) — — ~ Tn — t\Jn — k, Jn-1 — — Tn-l — ^] 

are stored in a matrix of functions q = {xli.kjif)) called the second order 
kernel (in state and duration). The element xQi.kjit) represents the probabil- 
ity that next wind speed will be in speed j at time t given that the current 
wind speed is k, the previous wind speed state was i and the duration in 
wind speed i before of reaching wind speed k was equal to x units of time. 

From the knowledge of the kernel we can define the cumulated second 
order kernel probabilities: 

xQi.k,j{t) '■— '^[Jn+1 — ji Tn+1 — ^ t\Jn — k, Jn—1 — h Tn — T^—l — x] 
t 

= ^xQi.kA^)- 

s=l 

(3) 

The process {Jn} is a second order Markov chain with state space E 
and transition probability matrix ^.P = 2.Q(oo). We shall refer to it as the 
embedded Markov chain. 

Define the unconditional waiting time distribution function in state k 
coming from state i with duration x as 

xHi,k{t) '.= P[T^-(_i Tn < t\Jn = k, Jn—1 = hTn Tn—l = x\ = ^ ^ xQi.kjjt)- 

(4) 

The conditional cumulative distribution functions of the waiting time in 
each state, given the state previously occupied and the duration of occupancy 
are defined as 

xGi,k,j{t) = P[T„_|_i — Tn, ^ t\Jn = k, Jn-1 = h Jn+l = ji Tn ~ 2^n-l = x\ 
xPl.kJ 

Denote by N{t) — sup{n : < e IN the number of transitions up 
to time t. The second order semi-Markov chain in state and duration can be 
defined as Z{t) = {Z\t),Z\t)) = (Jiv(t)-i, Jiv(t))). 
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If we define, Vi, k,j G E, and t G IN, the semi-Markov transition proba- 
bilities by: 

x(j)i.k,h.jit) ■= ^[JN{t) = j,JN{t)-i = h\Jo = /c, J-i = i,To = 0,To - T_i = x], 

(6) 

it is possible to prove that they verify the following system of equations: 

t 

x4>i.k,h.j(t) = l{i=/i,fc=j}(l — xHi.k{t)) + ^ ^ x<li.k,r{s) s4>k.r,h.j{^ " s). (7) 

rS-B s=l 

For this model more general duration dependent transition probabilities 
than ([7]) have been derived in D'Amico et al. [11]. 

3. Database and commercial wind turbine 

To check the validity of our model we perform a comparison of the behav- 
ior of real data and wind speeds generated through Monte Carlo simulations 
(the Monte Carlo algorithm is described in [18j) based on the model pre- 
viously explained. As made in our precedent works [HI [18] we use a free 
database of wind speed sampled in a weather station situated in Italy. This 
database consist of about 230thousands wind speed measures ranging from 
to 16 m/s and in a time interval of five years. To be able to model the 
wind speed as a semi-Markov process, the state space of wind speed has been 
discretized. In the example shown in this work we discretize wind speed into 
8 states (see table [T]) chosen to cover all the wind speed distribution (the 
states are described in the table). We apply our model to a real case of en- 
ergy production. For this reason we choose a commercial wind turbine, a 10 
kW Aricon HAWT with a power curve given in Figure [T] The power curve of 
a wind turbine represents how much energy it produces as a function of the 
wind speed. In this case, see Figure [T} we have no production of energy in the 
interval 0-2 m/s, instead the wind turbine produces energy almost linearly 
from 3 to 10 m/s, then, with increasing wind speed the energy production 
remains constant until the limit of wind speed, 32 m/s, in which the wind 
turbine is stopped for structural reason. Then the power curve acts as a filter 
for the wind speed. Through this power curve we can know when and how 
much energy is produced as a function of the wind speed at a given time. 
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Wind speed range m/s 


1 


to 1 


2 


2 


3 
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4 
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7 


8 


>7 



Table 1: Wind speed discretization 
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Figure 1: Power curve of the 10 kW Aricon HAWT wind turbine. 
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4. Reliability theory for the second order semi-Markov chain in 
state and duration 

In this section, following the research line in Barbu and Limnios [T3] and 
in Blasi, Janssen and Manca [12], we define and compute reliability measures 
for the second order semi-Markov chain in state and duration. 
Let E be partitioned into sets U and D, so that: 

E = UUD, = f/nD, U^E. 

The subset U contains all "good" states in which the system is working and 
subset D all "bad" states in which the system is not working well or has 
failed. In the wind speed model the up states are those for which the wind 
speed is sufficiently high to allow the production of energy or not excessive 
high such that the turbine should be turned off. 

In the following we present both the typical indicators used in reliability 
theory and also their application and, in order to verify the validity of our 
model, we compare the behaviour of these indicators for real and synthetic 
data. The indicators of the synthetic data are computed averaging over 300 
different trajectories. 

The three indicators that we apply evaluate are: 

(i) the point wise availability function A giving the probability that the 
system is working on at time t whatever happens on (0,t]. 

In our model we denote this function by 

xAi^k{t) = P[^^(t) e U\Jn{o) = k, Jn{o)-i = i, Tn(o) - Tn{o)^i = x], (8) 

The availability xAi^k{t) gives the probability that at time t the wind turbine 
produces energy given that at time zero the wind speed entered state k com- 
ing from state i with a duration equal to x. 

(ii) the reliability function R giving the probability that the system was 
always working from time to time t: 

xRi,kit) = F[Z^{u) eU -.Wue {0,t]\JN{o) = k, Jn{o)-i = i,TN(o)-TNio)-i = x] 

(9) 

The reliability xRi,k{t) gives the probability that the wind turbine will always 
produce energy from time zero up to time t given that at time zero the wind 
speed entered state k coming from state i with a duration equal to x. 
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Figure 2: Comparison of availability functions for real and simulated data 

(iii) the maintainability function M giving the probability that the system 
will leave the set D within the time t being in D at time 0: 

xMi^kit) = 1-P[Z2(m) E D, Vm e (0,it]|JAr(o) = k,JN(o)-i = i,TN(o)-TN{o)-i = x]. 

(10) 

The maintainability xMi^k{t) gives the probability that the turbine will pro- 
duce energy at least once from time zero up to time t given that at time zero 
the wind speed entered state k coming from state i with a duration equal to 

X. 

These three probabilities can be computed in the following way if the pro- 
cess is a second-order semi-Markov chain in state and duration of cumulated 
kernel Q = {xQi,k;j{t)). 

(i) the point wise availability function x^i.k{t) '■ 

xAi^kit) = X^X] x(pi,k;h.jii)- (11) 

hGE j£U 

In Figure [2] the availability functions of the real and synthetic data are 
compared. The comparison is made by varying the sojourn time and the 
starting state. Particularly, in Figure [2] it is plotted the availability function 

8 




for two different starting state i maintaining constant the ending state k for 
two different sojourn time x. 

(ii) the reliability function xRi,k{t) '■ 

to compute these probabihties, we will now work with another cumulated 
kernel Q = {xQi,k;jit)) for which all the states of the subset D are changed 
into absorbing states by considering the following transformation: 

{1 iii e D,k = j = i 
1 if I eU,k ED, J = k (12) 
otherwise. 

xRi,k{t) is given by solving the evolution equation of a second-order semi- 
Markov chain in state and duration but now with the cumulated kernel 

xQi,k;j(t) X Pi,k;j 'x Cri^k-j(t^ . 

The related formula will be: 

xRi,k{t) = ^ ^a;0j,fc;/i.i(^) (13) 

heu j(^u 

where x4^i,k;h.j{t) are the transition probabilities of the process with all the 
states in D that are absorbing, i.e. with cumulated kernel Q. 

Figure [3] shows the reliability function for real data compared with those 
simulated. The plotting procedure is the same as for the previous figure: we 
maintain constant the state k and we vary the state i and the sojourn time x. 
The theoretical trend of the reliability function is confirmed, the probability 
decreases at the increasing of the time interval. 

(iii) the maintainability function xMi^k{t)'- 

to compute these probabilities we will now work with another cumulated 
kernel Q = {xQi,k;jit)) for which all the states of the subset U are changed 
into absorbing states by considering the following transformation: 

{1 if ie U, k = j = i 
1 ifieD,keU,k = j (14) 
otherwise. 

xMi^kit) is given by solving the evolution equation of a second-order semi- 
Markov chain in state and duration but now with the cumulated kernel 

xQi,k;j(t^ X Pi,k;j 'x Gri^kj(t^ . 

The related formula for the maintainability function will be: 
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Figure 3: Comparison of reliability functions for real and simulated data 

where x4>i,k;h.j{t) is the transition probability of the process with all the states 
in U that are absorbing, i.e. with cumulated kernel Q. 

The maintainability function is plotted in Figure |4j As the previous 
figures, this one shows the comparison of the maintainability for the real and 
simulated data varying the starting state and sojourn time and maintaining 
constant the ending state. 

It is possible to note that all the indicators plotted above (availability, 
reliability and maintainability) depend strongly on the initial and final states 
and that there is also a great dependence on the sojourn time x. In fact, all 
the probabilities have different values if only the sojourn time x is changed 
keeping constant initial states i and final state k. One can also notice that 
the longer the wind has been in a state in one of the sets U or D in the past 
the higher is the probability that it will remain in that set in the future time. 
This shows that the future wind speed does not depend only on the present 
state but also on the past states and sojourn time giving even more evidence 
to support the semi-Markov hypothesis. 
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Figure 4: Comparison of maintainability functions for real and simulated 
data 

5. Conclusion 

In our precedent works, we presented new stochastic models for the gen- 
eration of synthetic wind speed data. In this work, instead, we compute, 
for the first time, typical indicators in rehabihty theory for wind speed phe- 
nomenon by introducing a new second order semi-Markov model. In order to 
check the validity of the presented model, we have compared the behaviour 
of these indicators for real and simulated data. To do this, we applied our 
model to a real case of energy production filtering real and simulated data 
by the power curve of a commercial wind turbine. The results show that 
the proposed model is able to reproduce correctly the behaviour of real data 
especially for short time intervals. Future works will focus on improving the 
model in order to be able to enlarge the time interval on which our model 
can reproduce accurately the behaviour of the real wind speed. 
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